Failures of the One-Step Learning Algorithm
نویسنده
چکیده
The Hinton network (Hinton, 2001, personal communication) is a deterministic mapping from an observable space x to an energy function E(x;w), parameterized by parameters w. The energy defines a probability P (x|w) = exp(−E(x;w))/Z(w). A maximum likelihood learning algorithm for this density model takes steps ∆w ∝ −〈g〉0+ 〈g〉∞ where 〈g〉0 is the average of the gradient g = ∂E/∂w evaluated at points x drawn from the data density, and 〈g〉∞ is the average gradient for points x drawn from P (x|w). If T is a Markov chain in x-space that has P (x|w) as its unique invariant density then we can approximate 〈g〉∞ by taking the data points x and hitting each of them I times with T , where I is a large integer. In the one-step learning algorithm of Hinton (2001), we set I to 1. In this paper I give examples of models E(x;w) and Markov chains T for which the true likelihood is unimodal in the parameters, but the one-step algorithm does not necessarily converge to the maximum likelihood parameters. It is hoped that these negative examples will help pin down the conditions for the one-step algorithm to be a correctly convergent algorithm. The Hinton network (Hinton, 2001, personal communication) is a deterministic mapping from an observable space x of dimension D to an energy function E(x;w), parameterized by parameters w. The energy defines a probability P (x|w) = exp(−E(x;w)) Z(w) , (1) where Z(w) = ∫ dx exp(−E(x;w)) (2) is the hard-to-evaluate normalizing constant or partition function. A maximum likelihood learning algorithm for this density model takes steps ∆w ∝ −〈g〉0 + 〈g〉∞ , (3) where 〈g〉0 is the average of the gradient g = ∂E/∂w evaluated at points x drawn from the data density, and 〈g〉∞ is the average gradient for points x drawn from P (x|w). If T is a Markov chain in x-space that has P (x|w) as its unique invariant density then we can approximate 〈g〉∞ by taking the data points x and hitting each of them I times with T , where I
منابع مشابه
The effect of emotion regulation training in decreasing emotion failures and self-injurious behaviors among students suffering from specific learning disorder(SLD)
Background: A great deal of attention has been given to the study of learning disorders. Hence, the aim of this research was to study the effect of emotion regulation training in decreasing emotion failures and self-injurious behaviors among students suffering from specific learning disorder. Methods: This was an experimental study with the pre-test, post-test and a control group. Research p...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملWavelet Neural Network with Random Wavelet Function Parameters
The training algorithm of Wavelet Neural Networks (WNN) is a bottleneck which impacts on the accuracy of the final WNN model. Several methods have been proposed for training the WNNs. From the perspective of our research, most of these algorithms are iterative and need to adjust all the parameters of WNN. This paper proposes a one-step learning method which changes the weights between hidden la...
متن کاملA Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملA Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملOnline Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کامل